STAIR: A System for Topical and Aggregated Information Retrieval

نویسندگان

  • C. V. Krishnakumar
  • Krishnan Ramanathan
چکیده

© STAIR : A System for Topical and Aggregated Information Retrieval C.V. Krishnakumar, Krishnan Ramanathan HP Laboratories HPL-2009-51 STAIR, Search, Focused Crawling, Information Retrieval Web content has exploded dramatically in the last decade and search is becoming increasingly complex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIRThe System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document. External Posting Date: March 6, 2009 [Fulltext] Approved for External Publication Internal Posting Date: March 6, 2009 [Fulltext] Published and presented at the First International Conference on HCI, Allahabad, India. Jan 20-23, 2009 Copyright the First International Conference on HCI, 2009 STAIR : A System for Topical and Aggregated Information Retrieval C.V.Krishnakumar Krishnan Ramanathan Stanford University, California, USA. HP Laboratories, Bangalore, India Web content has exploded dramatically in the last decade and search is becoming increasingly complex. In the current search paradigm, the user has to enter the query and is immediately presented results that are typically accessed sequentially. However, there are scenarios where the above model is not appropriate, either because results being in consumable form is more important than immediacy of results, or because the it is difficult and time consuming to navigate the results in sequential fashion. In this work, we describe the architecture, implementation and utility of STAIRThe System for Topical and Aggregated Information Retrieval, that uses a variant of focused crawling and retrieves relevant information from the web. We present a new interface that selects search results from different search engines, ranks the results and presents the most relevant results as an aggregated PDF document.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

بررسی تأثیرات ریشه‌یابی در بازیابی اطلاعات در زبان فارسی

Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...

متن کامل

QEA: A New Systematic and Comprehensive Classification of Query Expansion Approaches

A major problem in information retrieval is the difficulty to define the information needs of user and on the other hand, when user offers your query there is a vast amount of information to retrieval. Different methods , therefore, have been suggested for query expansion which concerned with reconfiguring of query by increasing efficiency and improving the criterion accuracy in the information...

متن کامل

The Feasibility Study of Launching Book Recommendation System on the Basis of a Lending and Selling System of e-Books and Digital Taktab

Background:The study was conducted to achieve three axes of goals (users, publishers and the system) by way of objectives related to: A) Users - measuring the level of their satisfaction with Taktab system and also use of various methods of data retrieval;  B) Publishers - Measuring the level of their satisfaction with Taktab system and also their expectations of the existence of a recommending...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009